# End-to-end speech synthesis

Fb Tts
A Vietnamese text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Safetensors
F
akthangdz
1
0
Kinyarwandatts Female Voice
This is an end-to-end deep learning based Kinyarwanda text-to-speech (TTS) system, trained using Coqui's TTS library and YourTTS architecture.
Speech Synthesis Transformers Other
K
DigitalUmuganda
17
1
Nepali Male V1
Apache-2.0
Nepali male voice synthesis model based on VITS architecture, supporting high-quality text-to-speech functionality
Speech Synthesis Transformers Other
N
tuskbyte
78
0
Mms Tts Div Finetuned Md F01
This is a Transformer-based text-to-speech (TTS) model that supports Dhivehi language speech synthesis.
Speech Synthesis Transformers Other
M
alakxender
61
0
Vits Cmn
Apache-2.0
VITS is an end-to-end text-to-speech model based on adversarial learning and conditional variational autoencoder, supporting Chinese speech synthesis.
Speech Synthesis Transformers Chinese
V
BricksDisplay
21
4
Mms Tts Mah
Marshallese text-to-speech model developed by Meta, using VITS end-to-end architecture to support high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
124
0
Mms Tts Cmo Script Khmer
A Central Mnong text-to-speech model developed by Meta, supporting conversion of text to natural speech
Speech Synthesis Transformers
M
facebook
142
1
Mms Tts Mos
Mossi text-to-speech model developed by Meta, based on VITS architecture, supporting end-to-end speech synthesis
Speech Synthesis Transformers
M
facebook
176
2
Mms Tts Cak Dialect Southcentral
A Kaqchikel (South Central dialect) text-to-speech model developed by Meta, which is part of the MMS project and supports speech synthesis in multiple languages.
Speech Synthesis Transformers
M
facebook
4
0
Mms Tts Lnd
The Runyoro text-to-speech model developed by Meta, which is part of the Massive Multilingual Speech (MMS) project
Speech Synthesis Transformers
M
facebook
7
0
Mms Tts Trs
A Text-to-Speech model for the Triqui (Chicahuaxtla dialect) developed by Meta, which is part of the Massive Multilingual Speech (MMS) project.
Speech Synthesis Transformers
M
facebook
4
0
Mms Tts Ljp
The Lampung Api text-to-speech model developed by Meta, which is part of the MMS multilingual speech project
Speech Synthesis Transformers
M
facebook
4
0
Mms Tts Trn
A text-to-speech model for Trinidadian Creole developed by Meta, which uses the VITS architecture to achieve high-quality speech synthesis.
Speech Synthesis Transformers
M
facebook
4
0
Mms Tts Bgr
A Chin and Bam text-to-speech model developed by Meta, part of the Massively Multilingual Speech (MMS) project.
Speech Synthesis Transformers
M
facebook
14
0
Mms Tts Sna
Shona text-to-speech model from Facebook's MMS project, implementing high-quality speech synthesis based on VITS architecture
Speech Synthesis Transformers
M
facebook
97
1
Mms Tts Smo
Samoan text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
21
0
Mms Tts Kir
A Kyrgyz text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis.
Speech Synthesis Transformers
M
facebook
149
4
Mms Tts Khm
Khmer text-to-speech model from Facebook's MMS project, implemented with VITS architecture for end-to-end speech synthesis
Speech Synthesis Transformers
M
facebook
217
7
Mms Tts Bam
A Bambara text-to-speech model developed by Meta, part of the Massively Multilingual Speech project, utilizing the VITS architecture for high-quality speech synthesis.
Speech Synthesis Transformers
M
facebook
87
4
Mms Tts Shn
A Shan text-to-speech model developed by Meta, part of the MMS project, supporting conversion of Shan text to natural speech.
Speech Synthesis Transformers
M
facebook
19
1
Mms Tts Azj Script Latin
North Azerbaijani (Latin script) text-to-speech model developed by Meta, part of the Massively Multilingual Speech project
Speech Synthesis Transformers
M
facebook
36
1
Mms Tts Pag
Pangasinan text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
18
0
Mms Tts Orm
Oromo text-to-speech model from Facebook's MMS project, implementing end-to-end speech synthesis based on VITS architecture
Speech Synthesis Transformers
M
facebook
498
4
Mms Tts Kaz
Kazakh text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
1,757
2
Mms Tts Sag
A Sango text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
25
1
Mms Tts Gbm
Garhwali text-to-speech model developed by Meta, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
18
0
Mms Tts Lao
VITS architecture Lao TTS model developed by Meta, supporting end-to-end speech synthesis
Speech Synthesis Transformers
M
facebook
486
1
Mms Tts Amh
Amharic text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
403
3
Mms Tts Ful
A Fula text-to-speech model developed by Meta as part of the Massively Multilingual Speech project, supporting the conversion of Fula text into natural speech.
Speech Synthesis Transformers
M
facebook
123
1
Mms Tts Fin
A Finnish text-to-speech model developed by Facebook, based on the VITS architecture, supporting high-quality Finnish speech synthesis.
Speech Synthesis Transformers
M
facebook
337
0
Mms Tts Ind
Indonesian text-to-speech model from Facebook's MMS project, implemented with VITS architecture for end-to-end speech synthesis
Speech Synthesis Transformers
M
facebook
1,462
11
Mms Tts Aka
Akan text-to-speech model developed by Facebook, based on VITS architecture, supporting high-quality speech synthesis.
Speech Synthesis Transformers
M
facebook
150
1
Mms Tts Swh
Swahili text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
161
9
Mms Tts Nan
Southern Min text-to-speech model released by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis Transformers
M
facebook
861
5
Mms Tts Mal
Malayalam text-to-speech model in Facebook's MMS project, implementing end-to-end speech synthesis based on VITS architecture
Speech Synthesis Transformers
M
facebook
307
2
Mms Tts Hat
Haitian Creole text-to-speech model developed by Meta, part of the Massively Multilingual Speech (MMS) project
Speech Synthesis Transformers
M
facebook
223
1
Mms Tts Por
Portuguese text-to-speech model from Facebook's MMS project, implementing high-quality speech synthesis based on the VITS architecture
Speech Synthesis Transformers
M
facebook
828
15
Vits Vctk
MIT
VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences. The model employs a conditional variational autoencoder (VAE) architecture, including a posterior encoder, decoder, and conditional prior module.
Speech Synthesis Transformers
V
kakao-enterprise
3,601
13
Vits Ljs
MIT
VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences.
Speech Synthesis Transformers
V
kakao-enterprise
1,127
41
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase